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EPO - DG 1 

AMORPHA-4, 11-DIENE «„ « 

2 7. 08. 1998 

The present invention relates to a DNA 
sequence, a polypeptide encoded by this sequence, and to 
the use of said DNA sequence and polypeptide. 

Human malaria is a commonly occurring 
5 widespread infectious disease, caused in 85 % of the 
cases by Plasmodium falciparum . This parasite is 
responsible for the most lethal form of malaria, malaria 
tropicana. Each year, malaria causes clinical illness, 
often very severe, in over 100 million people of which 

•10 eventually over 1 million individuals will die. 
Approximately 40% of the world's population is at risk of 
malaria infection (as estimated by the World Health 
Organization) . 

Malaria has traditionally been treated with 
15 quinolines, such as quinine, chloroquine, mefloquine and 
primaquine, and with antifolates. Unfortunately, most 
P . falciparum strains have become resistant to 
chloroquine, and some have developed resistance to 
mefloquine and halofantrine as well. Thus, novel 
20 antimalarial drugs to which resistant parasites are 

sensitive are urgently needed. Artemisinin, as well as 
its semisynthetic derivatives are promising candidates 
here . 

Artemisinin (Fig. 1), [3R- ( 3a , 5aS, 6S, 8a£, 
25 9a, 12S, 12 aR*) ] -Octahydro-3 , 6, 9 - trimethyl-3 , 12-epoxy-12H- 
pyrano [4 , 3- j ] -1 , 2 -benzodioxepin- 10 (3H) -one; molecular 
weight 282.35), also called arteannuin, qinghaosu or QHS, 
is a sesquiterpene lactone endoperoxide isolated from the 
aerial parts of the plant Artemisia annua L.- 
3 0 Artemisia annua L., also known as quinghao 

(Chinese) , annual or sweet wormwood, or sweet annie is an 
annual herb native to Asia. A . annua , a member of the 
Asteraceae, belongs to the tribe Anthemideae of the 
Asteroideae, and is a large herb often reaching more than 
35 2 . 0 m in height. It is usually single-stemmed with 
alternating branches. The aromatic leaves are deeply 
dissected and range from 2.5 to 5 cm in length. 
Artemisinin is mainly produced in the leaves as a 
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secondary metabolite at a concentration of 0.01 - 0.6 % 
on a dry weight base in natural populations. Artemisinin 
is unique to the plant A , annua with one possible 
exception of A. apiacea L. The A . annua used in this 
5 invention is of Vietnamese origin . 



artemisinin is a relatively expensive resource for a 
drug. Current research has thus been aimed at producing 
artemisinin at a larger scale by organic synthesis. 

10 However, because artemisinin consist of seven chiral 

carbon atoms, theoretically 2*^ = 128 isomers can be formed 
of which only one is identical to artemisinin. Because of 
this complex structure of artemisinin, production of this 
compound by organic synthesis is not profitable from a 

15 commercial point of view. 



of artemisinin may give rise to higher artemisinin levels 
in plants. To be able to interfere in the biosynthesis of 
artemisinin, the biosynthetic pathway has to be known, 
20 either completely or partially. Several attempts to 
elucidate the entire biosynthetic pathway have been 
undertaken. Until now, however, the exact pathway has 
remained largely unknown. 



25 unique pathway has been discovered which has not been 
published before. Thus, the formation of amorpha-4 , 11- 
diene (16, SS, 7S, 10Q!H-amorpha-4 , 11-diene) and the 
hydroperoxide of dihydroarteannuic acid in A. annua has 
not been reported before in literature. 

3 0 From literature it is known that terpene 

cyclases (synthases) are branch point enzymes, which 
likely play an important role in terpenoid biosynthesis. 
Overexpression of such a branch point enzyme (terpene 
cyclase) may increase terpenoid production in an 

35 organism. Factors that may influence the success of such 
an approach are , in the case of artemisinin, the number 
and nature of the subsequent biosynthetic steps leading 
to artemisinin . Figure 2 shows the postulated 



Because of its low concentration in plants. 



Genetic engineering of the biosynthetic pathway 



In the research that led to the present invention, a 
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biosynthetic pathway of artemisinin (based on literature 
and personal data) . This pathway is divided into three 
parts : 

The first part (Part I) represents the 
5 terpenoid (Isoprenoid) pathway. This pathway is a general 
pathway. Farnesyl diphosphate (farnesyl pyrophosphate) 
(FPP) , for example, is present in every living organism 
and it is the precursor of a large number of primary and 
secondary metabolites. It has been established that FPP 

10 is the precursor of all sesquiterpenes. Thus, by 
definition FPP is the precursor of artemisinin. 

Part II displays the cyclization of the general 
precursor FPP into the highly specific precursor amorpha- 
4,11-diene (further referred to as amorphadiene) , the 

15 first specific precursor of artemisinin. In this pathway 
amorphadiene synthase is a branch point enzyme, having a 
key position in the biosynthetic pathway of artemisinin. 

In part III, dihydroarteannuic acid (DHAA) , 
also called dihydroartemisinic acid, is photo-oxidatively 

20 converted into its hydroperoxide (DHAA-OOH) . This 

hydroperoxide of DHAA will spontaneously oxidize into 
artemisinin. No enzymes are involved in this part of the 
pathway and therefore it is impossible to alter 
artemisinin production by overexpression of genes 

2 5 involved in this part of the pathway. 

Cytochrome P-450 catalyzed enzymes and an 
enoate reductase are probably involved in the conversion 
of amorphadiene into DHAA, the transition state between 
part II and part III (see figure 2^) . Because no 

3 0 intermediates of this part of the pathway are known or 

present (accumulated) in detectable amounts, in the 
plant, (except arteannuic acid, also called artemisinic 
acid or 4 , 11 (13 ) -amorphadien-12 -oic acid) it is likely 
that these precursors are very rapidly converted into 
35 DHAA. A rate limiting step in this part of the pathway is 
not very likely. 

Taking al these aspects into account the most 
logical step to be altered by genetic interfering, is the 



conversion (cyclization) of FPP into amorphadiene by 

amorphadiene synthase . 

The object of the present invention is 

therefore to provide a DNA sequence which exhibits at 
5 least a 70% homology to the sequence as shown in figure 

9a, and which codes for a polypeptide having the 

biological activity of the enzyme amorphadiene synthase. 
The biological activity of the enzyme 

amorphadiene synthase relates to the conversion of the 
0 general precursor f arnesyl pyrophosphate (FPP) into the 

specific precursor amorphadiene , which is further 

converted to artemisinin in A. annua . 

By transforming a suitable host cell with the 

DNA sequence of the invention, the conversion of f arnesyl 
5 pyrophosphate (FPP) into the highly specific precursor 

amorphadiene can be increased . Suitable host cells are 

for example bacterial cells, such as E . coli , or plant 

cells such as those of A . annua > 

By adding FPP to a culture medium further 
0 comprising the polypeptide of the invention (as described 

in example I), or transformed cells, e.g. E , coli , 

comprising the DNA sequence of the invention (as 

described in examples III and IV) , FPP is converted into 

amorphadiene . 

5 In addition, several plants are capable of 

producing large amounts of FPP making them potential 
organisms for amorphadiene production. 

Transformed cells can be used either in 
disrupted form, by for example sonication, or as intact 

0 cells, as a source of amorphadiene. 

Overexpression of the amorphadiene synthase 
encoding gene in A. annua will increase artemisinin 
production, because the terpene cyclase is expected to be 
the rate limiting step. 

5 The results of the present research (postulated 

biosynthetic pathway of artemisinin) make the presence of 
a single major rate limiting step at the place of the 
amorphadiene synthase very likely* Overexpression of the 



amorphadiene synthase encoding gene can increase the 
production of artemisinin in A. annua . 

The chemical structure of the first specific 
precursor of artemisinin, a cyclization product of FPP, 

5 was not known in literature. Neither has anyone so far 
detected such a compound in A . annua . Nevertheless it was 
possible to predict a likely structure for this 
cyclization product, based on the structure of DHAA and 
arteannuic acid (Fig. 2^). The structure predicted in this 

0 way was consistent with a compound which is known in 
literature as 4 , 11 -Amorphadiene (IS, lOaH-Cadina- 
4,11-diene, incorrect name), as depicted in figure 3. 
This compound, isolated from Viguiera oblongif olia . has 
previously been described by Bohlmann et al . 

5 (Phytochemistry, vol. 23, 5, 1183-1184 (1984)). Starting 
from arteannuic acid (isolated from A. annua ) , it was 
possible to synthesize amorphadiene. Amorphadiene 
obtained in this way was in all chemical and physical 
aspects identical to amorphadiene as described by 

0 Bohlmann et al . , and this standard was used to show the 
presence of amorphadiene in a terpene extract of A. annua 
A further object of the present invention is t 
provide a polypeptide having the biological activity of 
the enzyme amorphadiene synthase, obtainable by a proces 

5 as described in example I, This polypeptide can be used 
to convert FPP into amorphadiene which subsequently can 
be converted into artemisinin in A . annua , 

Amorphadiene, produced by a suitable host 
organism transformed with the DNA sequence of the 

0 invention as precursor, can subsequently be chemically 
converted to dihydroarteannuic acid. Dihydroarteannuic 
acid can be used as an antioxidant. 

The chemical conversion of amorphadiene into 
dihydroarteannuic acid (Fig. 12) starts with the 

5 enantio-, stereo- and regioselective (anti-markownikof f ) 
hydroboration of amorphadiene with BH3, yielding a 
trialkylborane, followed by an oxidation of the 
trialkylborane with NaOH / H^O^ yielding the alcohol 




(Advanced Organic Chemistry, Jerry March, 4th Edition, 
Wiley, 1992) . A mild oxidation of the alcohol to the acid 
can be obtained by PDC (pyridinium dichromate) without 
attacking the second double bond (Fig. 12) (Organic 
5 Synthesis, M.B. Smith, 1st Edition, McGraw-Hill, 1994) . 

Many genes encoding enzymes involved in the 
biosynthetic pathway of farnesyl diphosphate are cloned 
and known in literature. For A . annua . for example, the 
sequence of the farnesyl diphosphate synthase encoding 

10 gene is known in literature (Y. Matsushita, W-K, Kang and 
V. Charlwood Gene, 172 (1996) 207-209) . A further 
approach to introduce or increase the amorphadiene 
production in an organism, is to transform such an 
organism (for example A . annua ) simultaneously with the 

15 DNA sequence of the invention with one or more genes 

involved in the biosynthesis of farnesyl diphosphate. The 
expression of a fusion protein of amorphadiene synthase 
and farnesyl diphosphate synthase may be an example here. 

(Sesqui) terpenes , such as amorphadiene, are 

20 also known as flavor and fragrance compounds in the food 
and perfume industry. In addition, terpenes play a role 
in plant- insect interactions, such as the attraction or 
repulsion of insects by plants. In addition, 
dihydroarteannuic acid, which is an intermediate in the 

2 5 metabolic route from amorphadiene into artemisinin in 

A , annua , can be used as an antioxidant. 

Amorphadiene, obtained by (over) expression of 
the DNA sequence of the invention, or by using the 
polypeptide of the invention, can be applied for these 

3 0 purposes as well . 

The plants that can be used for this invention 
are preferably plants already producing artemisinin. A 
prime example is Artemisia annua , as this species 
contains the remainder of the pathway leading to 
35 artemisinin. However, this invention may also be used for 
the production of amorphadiene in plants, which, as 
mentioned before, can be used as a flavor or fragrance 
compound or biocide, or can be converted to artemisinin. 



1 



either chemically or by bioconversion using 
microorganisms or plant cells. 

The plant that can be used for the production 
of amorphadiene is preferably a plant already producing 
5 sesquiterpenes, as these plants already have the basic 
pathway and storage compartments available, or a plant in 
which the biosynthesis of sesguiterpenoids can be induced 
by elicitation. The methods of this invention are readily 
applicable via conventional techniques to numerous plant 

10 species, including for example species from the genera 

Carum, Cichorium , Daucus , Juniperus , Chamomilla , Lactuca , 
Poaostemon and Vetiveria , and species of the inducible 
(by elicitation) sesquiterpenoid phytoalexin producing 
genera Capsicum , Gossvpium , Lvcopersicon . Nicotiana , 

15 Phleum , Solanum and Ulmus . However, also common 

agricultural crops like soybean, sunflower and rapeseed 
may be interesting candidates here. 

The invention will be further illustrated by 
the following examples and figures, but will not be 

20 limited thereto. 



EXAMPLES 
EXAMPLE 1 

25 Conversion of farnesvl pyrophosphate into amorphadiene by 
amorphadiene synthase 

A, Isolation, partial purification and identification of 
amorphadiene synthase from A . annua 

During enzyme isolation and preparation of the 

30 assays, all operations were carried out on ice or at 4*'C. 
Ten grams of frozen young leaves from greenhouse -grown 
A. annua were ground in a pre-chilled mortar and pestle in 
40 ml of pre-chilled buffer containing 25 mM MES (Ph 
5.5), 20% (v/v) glycerol, 25 mM sodium ascorbate, 25 mM 

3 5 NaHSOj, 10 mM MgCl^ and 5 mM DTT (buffer A) slurried with 
1 g polyvinylpolypyrrolidone (PVPP) and a spatula tip of 
purified sea sand. Ten grams of polystyrene resin 
(Amberlite XAD-4, Serva) were added and the slurry was 



stirred carefully for 10 min and then filtered through 
cheesecloth. The filtrate was centrifuged at 20,000g for 
20 min (pellet discarded), and then at lOO.OOOg for 90 
min. A 3 -ml subsample of the supernatant was desalted to 
5 a buffer containing 15 mM MOPSO (pH 7.0), 10% (v/v) 

glycerol, 1 mM sodium ascorbate, 10 mM MgCl^ and 2 mM DTT 
(buffer B) and used for enzyme assays/product 
identification (see below at ' B' ) . 

The remainder of the supernatant was added to 

10 12.5 g DEAE anion exchanger (Whatman DE-52) , which had 
been rinsed several times with buffer A, and stirred 
carefully for 10 min. After centrif ugation at 18,000g for 
20 min, the supernatant was decanted and the DE-52 pellet 
discarded. Proteins in the supernatant were precipitated 

15 by adding (NH^)2SO^ to a final concentration of 70%, 
careful stirring for 3 0 min, and centrif ugation at 
2 0,000g for 10 min. The resulting pellet was resuspended 
in 6 ml buffer A and desalted to buffer B. After addition 
of glycerol up to 30% (v/v) this enzyme preparation could 

20 be frozen in liq and stored at -80°C without loss of 
activity. 0.5 ml of this enzyme preparation was applied 
to a Mono-Q FPLC column (HR5/5, Pharmacia Biotech), 
previously equilibrated with buffer B without sodium 
ascorbate, with 0.1% Tween-2 0 . The enzyme was eluted with 

2 5 a gradient of 0-2.0 M KCl in the same buffer. For 

determination of enzyme activities, 50 ptL of the 0.75 -ml 
fractions were diluted 2 -fold in an Eppendorf tube with 
buffer B and 20 ^^M [^H] FPP was added. The reaction mixture 
was overlaid with 1 mL of hexane to trap volatile 

3 0 products and the contents mixed. After incubation for 3 0 

min at 30°C, the vials were vigorously mixed, and 
centrifuged briefly to separate phases. A portion of the 
hexane phase (750 /iL) was transferred to a new Eppendorf 
tube containing 40 mg of silica gel (0.035-0.07 mm, pore 
3 5 diameter G nm, Janssen Chimica) to bind terpenols 

produced by phosphohydrolases , and, after mixing and 
centrifugation, 500 /iL of the hexane layer was removed 
for liquid scintillation counting in 4 . 5 ml of Ultima 



Gold cocktail (Packard) . The active fractions were 
combined, and an assay carried out to determine product 
identity (see below) . After the Mono-Q step , the enzyme 
was separated from all other FPP-converting activities 
(Fig. 4C) . This enzyme preparation was used for the 
measurement of enzyme characteristics such as molecular 
weight and K^. The molecular weight was determined using 
size-exclusion chromatography. 200 /xL of the Mono-Q 
eluent was loaded on a Superdex 75 (H/RlO/30 , Pharmacia 
Biotech) and eluted in the same buffer as used for Mono- 
Q . Enzyme activities in 0.5 ml fractions were determined 
as described for Mono-Q, but using undiluted eluens . The 
column was calibrated using cytochrome C, ribonuclease A, 
Qf-chymotrypsinogen, ovalbumin and BSA (all from Sigma) , 
The estimated molecular weight was 56 kDa (Figure 13) . 
Enzyme-kinetics were determined using 5- and 10-fold 
diluted Mono-Q eluted enzyme preparation and [^H] -FPP 
concentrations ranging from 0.25-100/iM. for 
amorphadiene synthase was 0.6 /xM. 

B. Determination of product identity 

For determination of product identity, 20 /iM 
[^H] -FPP (Amersham; for radio-GC analysis) or 50 
unlabelled FPP (Sigma; for GC-MS analysis) were added to 
1 ml of the enzyme preparations. After the addition of a 
ImL redistilled pentane overlay to trap volatile 
products, the tubes were carefully mixed and incubated 
for 1 h at SO^^C. Boiled samples were used as controls. 
Following the assay, the tubes were vigorously mixed. The 
organic layer was removed and passed over a short column 
of aluminum oxide overlaid with anhydrous MgSO^ . The assay 
was extracted with another 1 mL of diethyl ether which 
was also passed over the aluminum oxide column, and the 
column washed with 1.5 mL of diethyl -ether . For GC- 
analysis, the combined pentane/diethyl -ether mixture was 
slowly concentrated under a stream of . 

Radio-GLC was performed on a Carlo-Erba 4160 
Series gas chromatograph equipped with a RAGA-9 0 



10 

radioactivity detectox' (Raytest, Straubenhardt , Germany) . 
Sample components eluting from the column were 
quantitatively reduced before radioactivity measurement 
by passage through a conversion reactor filled with 
5 platinum chips at 800°C. Samples of 1 jiiL were injected in 
the cold on-column mode. The column was a fused silica 
capillary (30 m x 0.32 mm i.d.) coated with a film of 
0.25 /im of polyethylene glycol (EconoCap EC-WAX, Alltech 
Associates) and operated with a He- flow of 1.2 mL min"^. 

10 The oven temperature was programmed to 70°C for 5 min, 

followed by a ramp of 5° min'^ to 210°C and a final time of 
5 min. To determine retention times and peak identities 
(by co-elution of radioactivity with reference 
standards) , about 20% of the column effluent was split 

15 with an adjustable splitter to an FID (temperature 270^^0) . 
The remainder was directed to the conversion reactor and 
radio detector. was added prior to the reactor at 3 mL 
min*^, and CH^ as a quench gas prior to the radioactivity 
detector (5 mL counting tube) to give a total flow of 36 

20 mL min'^. The major [^H] -labelled product co-eluted with 
the amorphadiene reference standard (ret. time 14 min) 
(Fig. 4B) . The second radiolabelled product is farnesol, 
the product of aspecific phosphohydrolase activity. After 
the Mono-Q step, the enzyme was separated from all other 

25 FPP- converting activities (Fig. 4C) . This enzyme 
preparation was used for the measurement of enzyme 
characteristics such as molecular weight and K, . 

GC-MS analysis was performed using a HP 5890 
series II GC and HP 5972A Mass Selective Detector 

30 (Hewlett-Packard) equipped with an HP-5MS or HP-Innowax 
column (both 3 0 m x 0.25 mm i.d., 0.2 5 fim df ) . The oven 
was programmed at an initial temperature of 70^C for 1 
min, with a ramp of 5^C min'^ to 210°C and final time of 5 
min. The injection port (splitless mode) , interface and 

3 5 MS source temperatures were 175, 290 and ISO^'C, 

respectively, and the He inlet pressure was controlled by 
electronic pressure control to achieve a constant column 
flow of 1.0 ml min"^. Ionization potential was set at 70 
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eV, and scanning was performed from 3 0-250 amu . The 
(NH^jjSO^ precipitated enzyme preparation was free of 
endogenous sesquiterpenes . GC-MS analysis on the two 
different GC-columns of sesquiterpene products generated 
5 from FPP by this enzyme preparation showed that the main 
product had a mass spectrum and retention time equal to 
that of the semi -synthetically produced amorphadiene 
(Fig. 5) , 



10 EXAMPLE 2 

Isolation and characterization of the amorphadiene 

synthase encoding gene 

A, Induction of transcription 

As revealed in part III of figure 1, DHAA is 

15 photo-oxidatively converted into DHAA-OOH. In this 

reaction a reactive form of oxygen (singlet O^) is added 
to DHAA. DHAA plays the role of an anti -oxidant , a 
scavenger of reactive oxygen species, Artemisinin is the 
stable end product of this reaction in which reactive 

20 oxygen is stored. Under stress conditions, (for example 
photo-stress, frost, drought or mechanical damage) 
reactive species of oxygen are formed in the plant. In 
response to this reactive oxygen generally plants are 
producing anti-oxidants . It is likely that A. annua will 

25 produce DHAA as anti-oxidant in response to this release 
of reactive oxygen. By exposing A . annua to stress 
conditions the transcription of the gene encoding 
amorphadiene synthase will be induced. To achieve this 
situation A . annua plants grown under climate room 

30 conditions (23 °C, 90% moisture, 3000 lux) were exposed to 
stress conditions by putting them for one hour at 
approximately 30% moisture (drought stress) and 6000 lux 
(photo stress) at 30°C. 
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B. Isolation of total RNA 

Total RNA of stress induced plants (according 
to example 2. A) was isolated from young leaves by the 
method of Verwoerd et al . (Nucleic Acids Research, 17, 6, 
5 2362 (1989)). DNase I (Deoxyribonuclease I, RNase free) 
was used to remove DNA from the RNA isolate. The DNase I 
was inactivated by exposure at 70^C during 15 minutes. 

C. cDNA synthesis 

10 The reverse transcription reaction was carried 

out in a 20 /xl reaction containing 5 fig total RNA, 0.2 /xg 
oligo (dT)^2, 0 . 5 mM each dATP, dTTP, dCTP and dGTP, 10 mM 
DTT, 2 U ribonuclease inhibitor (Gibco BRL) , first strand 
synthesis buffer (Promega) and catalyzed with 200 U 

15 Moloney murine leukemia virus (M-MLV) reverse 

transcriptase RNase H minus (Promega) . After 1 h 
incubation at 37°C the reaction was stopped by storing 
the reaction mixture at -20°C. 

20 D. PCR-based probe generation 

Based on comparison of sequences of terpenoid 
synthases, two degenerated primers were designed for two 
conserved regions. The sequence of the sense primer 
(primer A) was 5'-GA(C/T) GA(G/A) AA(C/T) GGI AA(G/A) 

25 TT(C/T) AA(G/A) GA-3 ' and the sequence of the anti sense 
primer (primer B) was 5'-CC {G/A)TA IGC (G/A)TC (G/A)AA 
IGT (G/A)TC (G/A)TC-3'. PGR was performed in a total 
volume of 100 /xl containing 0.5 ^tM of each of these two 
primers, 0.2 mM each dNTP, 1 U Super Taq polymerase / Ix 

3 0 PGR buffer (HT Biotechnology LTD, Cambridge, England) and 
2 fil cDNA. The reaction was incubated in a thermocycler 
(PTC 150, MJ-research) with 1 minute denaturation at 
95°C, 1 minute annealing at 40°C and 1 minute and 15 
seconds elongation at 72°C during 40 cycles. Agarose gel 

35 electrophoresis revealed a single specific PGR product of 
approximately 550 bp (538bp) . Such a specific 
amplification product was only obtained when using cDNA 
made of RNA isolated from stress induced plants. The PGR 
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product was made blunt by using DNA polymerase I large 
fragment (Klenow) , gel-purified and subcloned in Sma I 
digested pGEM 7Zf ( + ) (Stratagene) (Fig. 6) and E.coli DH5a 
(Gibco BRL) was transformed with this construct. The 
5 inserts of 8 individual transf ormants were sequenced and 
they all had the same sequence as shown in figure 7. 

E, cDNA Library construction 
Synthesis of the second strand of the cDNA was 

10 done analogous to the RiboClone® cDNA synthesis System 
(Promega) . After ligation with EcoR I ( Not I) adapters 
(Gibco BRL) with sequence: 

5 ' -pGTCGACGCGGCCGCG-3 ' 
15 3 • - CAGCTGCGCCGGCGCTTAA-OH-5 ' 

the double stranded DNA was ligated into X ExCell EcoR 
l/CIP (Pharmacia Biotech) . For packaging and plating of 
the cDNA library, the Ready-To-Go® Lambda Packaging Kit 
20 (Pharmacia Biotech) was used. The titer of the 

unamplified library was 1.2 x 10^ plaque forming units. 

F. Library screening 
For library screening 20 0 ng of the PGR 

25 amplified probe (Fig. 7) was gel purified, randomly 

labeled with [c^-^^P] dCTP, according to the manufacturer's 
recommendation (Random Primed DNA Labeling Kit, 
Boehringer Mannheim Biochemica) and used to screen 
replica filters of 10"^ plaques of the cDNA library plated 
30 on E . coli NM 522. The hybridization was performed for 16 
h at 68°C in 1 M NaCl, 1% SDS and 10 % PEG (5000-7000) . 
Filters were washed two times for 10 minutes at-50*=»C in 2 
X SSC with 0,1 % SDS and exposed for 16 h to a Fuji X-ray 
film at -70*^0. Clones yielding positive signals were 
3 5 isolated through a second and third round of 

hybridization. By transf ecting E . coli NP66 (Pharmacia 
Biotech) with the positive clones, plasmid releases (Fig. 
8) were obtained according to the manufacturer's 
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instructions (Pharmacia Biotech) . Sequencing of these 
positive clones yielded a sequence as revealed in figure 
8A. 

5 EXAMPLE 3 

Expression of the amorphadiene synthase encoding gene in 
E. coli BL21 (DE3) 

For functional expression the cDNA clone was 
subcloned in frame into the expression vector pET lid 

10 (Stratagene) . To introduce suitable restriction sites for 
subcloning, the gene was amplified by PGR using a sense 
primer (primer C) 5 • -GT CGA CAA ACC ATG G CA CTT ACA GAA 
G-3 • (introducing a Nco l site at the start codon ATC) and 
an anti-sense primer (primer D) 5 ' -GGAT GGATCC TCA TAT ACT 

15 CAT AGG ATA AAC G-3' (introducing a BamHI site directly 
behind the stop codon TGA) . The PGR reaction was 
performed under standard conditions. After digestion with 
BamH I and Ncol, the PCR product (Fig. 9^) and the 
expression vector pET lid were gel purified and ligated 

20 together to yield a construct as revealed in figure 9. 

To obtain expression; this gene construct (fig. 
9) , pET lid without an insert as negative control, and 
pET lid with the tobacco 5- epi -aristolochene synthase 
(TEAS) gene (Back et al . , Archives of Biochemistry and 

25 Biophysics, 315, 2, 527-532 (1994); Facchini & Chappell, 
Proc, Natl. Acad. Sci. USA, 89, 11088-11092 (1992); Back 
& Chappell, The Journal of Biological Chemistry, 270, 13, 
7375-7381 (1995)) as positive control were transformed to 
E . coli BL21(DE3) (Stratagene), and grown overnight on LB 

30 agar plates supplemented with ampicillin at 37°C. 

Cultures of 50 ml LB medium supplemented with ampicillin 
(100 ^g/ml) and 0.25 mM isopropyl-l-thio-S-D- 
galactopyranoside (IPTG) were occulated with these over 
night cultures to A^^q = 0.5 and grown for 3 h at 27 °C. The 

3 5 cells were harvested by centrif ugation during 8 minutes 
at 2000 g and resuspended in 2 ml assay buffer. An 
aliquot of 1 ml resuspended cells was sonicated on ice 
four times for 5 seconds with 30 second intervals. 
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centrifuged for 5 minutes at 4®C in a microfuge (13,000 
rpm) and the supernatant used for cyclase enzyme activity 
determinations and SDS-PAGE gel electrof orese . Expression 
of the amorphadiene synthase gene-pET lid construct (Fig. 
9) in E . coli BL21(DE3) yielded a protein of approximately 
50 to 60 kDa as shown in figure 10 lane 5 to 10. This 
agrees well to the size of amorphadiene synthase isolated 
from A . annua , which was determined to be 56 kDa (Fig, 
13) . 



EXAMPLE 4 

Conversion of FPP into amorphadiene by amorphadiene 
synthase expressed in E.coli> 

Besides the supernatant of sonicated cells, 

15 also intact cells were used in the FPP assay. The FPP 
assay, GC-RAGA and GC-MS analyses were performed as 
described previously. Figure 11 and llA are revealing the 
GC-RAGA chromatograms of the assays with intact 
transformed cells and with the supernatant of sonicated 

20 transformed cells, respectively. In both assays 

amorphadiene was produced. Identification of these assay 
products with the GC-MS gave a mass-spectrum identical to 
the mass- spectrum of the reference amorphadiene with a 
quality score of 99 % (maximum score) , mass spectra were 

25 identical to the spectra as shown in Fig. 5. No 

amorphadiene was found in assays done with the positive 
and negative controls. 



EXAMPLE 5 

3 0 Expression of amorpha-4 . 11-diene synthase in transgenic 
tobacco 

There are many ways to introduce DNA into a 
plant cell. Suitable methods by which DNA can be 
introduced into the plant cell include Aarobacterium 
3 5 infection or direct delivery of DNA such as, for example, 
by PEG-mediated transformation of protoplasts (Omirulleh 
et al.. Plant Molecular Biology, 21, 415-428 (1993)) or 
electroporation, by acceleration of DNA coated 
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microproj ectiles (for example, micropro j ectile 
bombardment) microinjection, etc. 

Because Agrobacterium tumefaciens -mediated 
transformation of Art em i s i a annua and Nicotiana tabacum 
5 with a sesquiterpene cyclase gene is known in literature 
(Vergauwe et al . , Plant Cell Reports, 15, 929-933 (1996); 
Hohn and Ohlrogge, Plant Physiol., 97, 460-462 (1991)) 
delivery of expression units (cassettes) , containing the 
amorphadiene synthase encoding gene, mediated by 

10 Aqrobacterium seemed to be a rational approach. 

There are several binary vector systems 
suitable to transfer the amorphadiene synthase encoding 
gene assembled in an expression cassette behind a 
suitable promoter (for example, the cauliflower mosaic 

15 virus 35S promoter) and upstream of a suitable terminator 
(for example, the nopaline synthase transcription 
terminator) to tobacco and/or A . annua . To mobilize the 
recombinant binary vector to the target Agrobacterium 
strain, a triparental mating procedure was carried out by 

2 0 using E . coli carrying the recombinant binary vector and a 

helper E . coli strain carrying a plasmid {pRK2013 for 
example) which was able to mobilize the recombinant 
binary vector to the target Agrobacterium strain. This 
transformed Agrobacterium strain was used for 
25 transformation of explants from the target plant species. 
Only the transformed tissue carrying a resistance marker 
(for example kanamycin- resistance , present between the 
binary plasmid T-DNA borders) regenerated on a selectable 
(for example kanamycin containing) regeneration medium. 

3 0 (According to Rogers SG, Horsch RB, Fraley RT Methods 

Enzymol (1986)118: 627-640). 

EXAMPLE 6 

Conversion of amorphadiene into artemisinin (DHAA) bv 
35 A. annua 

This assay was carried out in a way analogous 
to the method as described by Koepp et al . (The Journal 
of Biological Chemistry, 270, 15, 8686-8690 (1995)), 
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Radioactive {^H-labeled) amorphadiene was fed to leaf 
discs of A . annua . For the infiltration of amorphadiene 
into the leaf discs of A . annua the radioactive 
amorphadiene can be made water soluble by complexation 
5 with cyclodextrins , for example. Radioactive amorphadiene 
is obtained by using the FPP-assay with the transformed 
E . coli BL21(DE3) cells (carrying the cloned amorphadiene 
synthetase gene of A. annua ) . Identification of the 
product (s) made in this assay was done by radio-GC 
10 analysis in combination with MS and or GC-MS analysis. 
The expected intermediates arteannuic acid (AA) , 
dihydroarteannuic acid (DHAA) and the end product 
artemisinin are all used as references. 

15 EXAMPLE 7 

Expression of amorpha-4 , 11 -diene synthase in transgenic 
A. annua and the production of artemisinin 

Transformed A . annua plants were prepared as 
described in example 5, 

20 For the regeneration of A . annua the medium for 

callus, shoot and root induction consisted of Murashige 
and Skoog micro and macro elements including modified 
vitamins (Duchefa Biochemie, Haarlem, The Netherlands) , 4 
% (w/v) sucrose, 0,1 mg/L Indole-3 -acetic acid (lAA) , 0,1 

25 mg/L 6 -benzylaminopurine (BAP) and 0,8 % (w/v) agar. The 
pH was adjusted to 5.7 with NaOH prior to the addition 
of agar. The medium was autoclaved at 1 bar for 2 0 min. 
Transformed explants were regenerated on this medium to 
fully regenerated plants. 



EPO - DG 1 

' 2 7. 08. 1998 

CLAIMS 

1. Isolated DNA sequence encoding a polypeptide 
having the biological activity of amorpha-4 , 11-diene 
synthase , 

2 . DNA sequence as claimed in claim 1 which 
5 exhibits at least 70% homology to the sequence as shown 
in Figure 9A or the complementary strand thereof and 
which codes for a polypeptide having the biological 
activity of the enzyme amorphadiene synthase. 

• 3. DNA sequence as claimed in claim 2, which is 
10 at least 80%, preferably at least 90%, more preferably at 
least 95% homologous to the sequence in Figure 9A. 

4. DNA sequence as claimed in claims 1-3, which 
has the sequence as shown in Figure 9A. 

5. DNA sequence as claimed in claims 1-4, 

15 characterized in that it has been isolated from plants 
producing amorpha-4 , 11 -diene , for example A . annua and 
V, oblongif olia 

6 . Use of a DNA sequence as claimed in claims 
1-5 for transforming or transfecting a host cell. 

20 7. DNA construct comprising the DNA sequence as 

claimed in claims 1-5 operably linked to suitable 
transcription initiation and termination sequences. 

8. Host cell comprising a DNA sequence as 
claimed in claims 1 to 5 or a DNA construct as claimed in 

25 claim 7. 

9. Host cell as claimed in claim 8, wherein the 
cell is a bacterial cell, in particular an E . coli cell. 

10. Host cell as claimed in claim 8, wherein 
the cell is a plant cell. 

30 11. Host cell as claimed in claim 10, wherein 

the cell is derived from a plant itself producing 

sesquiterpenes . 

12. Host cell as claimed in claim 11, wherein 

the cell is a A . annua cell or a V.obloncri folia cell. 
35 13. Host cell as claimed in claim 11, wherein 

the cell is derived from a plant selected from the group 



consisting of the genera Carum, Cichorium , Daucus , 
Juniperus , Chamomilla , Lactuca . Poaostemon and Vetiveria . 

14. Host cell as claimed in claim 10, wherein 
the cell is derived from a plant in which the 

5 biosynthesis of sesquiterpenoids can be induced by 
elicitation . 

15. Host cell as claimed in claim 14, wherein 
the cell is derived from a plant selected from the group 
consisting of the genera Capsicum , Gossypium , 

10 Lvcopersicon , Nicotiana . Phleum , Solanum and Ulmus . 

16. Host cell as claimed in claim 10, wherein 
the cell is derived from a plant selected from the group 
of soybean, sunflower and rapeseed. 

17. Host cell as claimed in claims 8 and 10-16, 
15 which cell is part of a tissue or organism. 

18. Transgenic tissue, consisting at least part 
of host cells as claimed in claims 8 and 10-16. 

19. Transgenic organism, consisting at least 
part of host cells as claimed in claims 8 and 10-16. 

20 20. Polypeptide having the biological activity 

of the enzyme amorphadiene synthase in isolated form 
obtainable by isolating the polypeptide from A . annua or 
V. oblonQi folia by a process as described in Example 1. 

21 . Recombinant polypeptide having the 

2 5 biological activity of the enzyme amorphadiene synthase 

obtainable by expressing a DNA sequence as claimed in 
claims 1-5 in a suitable host cell as claimed in claims 
8-17 . 

22. Method of preparing amorphadiene, 

3 0 comprising : 

a) incubating a polypeptide as claimed in claim 
20 and 21 in the presence of farnesyl pyrophosphate (FPP) 
in an incubation medium at a suitable temperature and 
during a suitable period of time; and 
35 b) optionally isolating the arporphadiene thus 

formed. 

23. Method of preparing amorphadiene, 
comprising the steps of : 



a) transfecting or transforming a suitable host 
cell with a DNA sequence as claimed in claims 1-5 or a 
construct according to claim 7 to obtain transgenic host 
cells ; 

5 b) expressing the said DNA sequence in the pre- 

sence of farnesyl pyrophosphate (FPP) to form 
amorphadiene ; and 

c) optionally isolating the amorphadiene thus 

formed, 

10 wherein the expression level of the amorphadiene synthase 
is higher in transgenic host cells, tissues or organisms 
harboring an endogenous version of the DNA sequence than 
in non- transgenic host cells, tissues or organisms. 

24. Method of preparing arteraisinin, 
15 comprising: 

a) incubation of a polypeptide as claimed in 
claim 2 0 and 21 in the presence of farnesyl pyrophosphate 
(FPP) and the enzymes that further convert amorpha-4 , 11- 
diene to artemisinin in an incubation medium at a 

2 0 suitable temperature and during a suitable period of 

t ime ; and 

b) optional isolation of the artemisinin thus 

formed , 

25. Method of preparing artemisinin, 
25 comprising: 

a) transfecting or transforming a suitable host 
cell, tissue or organism with a DNA sequence as claimed 
in claims 1-5 or a construct according to claim 7 to 
obtain transgenic host cells, tissues or organisms; 

3 0 b) expressing the said DNA sequence in the 

presence of farnesyl pyrophosphate (FPP) ; and 

c) optionally isolating the amorpha-4 , 11- diene 
thus formed, 

wherein the transgenic host cells, tissues or organisms 
3 5 harbor the genetic information coding for the enzymes 
that further convert amorpha-4 , 11 -diene to artemisinin 
and wherein the expression level of the amorpha-4 , 11- 
diene synthase is higher in transgenic host cells. 



tissues or organisms harboring an endogenous version of 
the DNA sequence than in non- transgenic host cells, 
tissues or organisms. 

26, Source of artemisinin, comprising host 
5 cells, tissues or organisms harboring a DNA sequence as 
claimed in claims 1-5 and the genetic information coding 
for the enzymes that further convert amorpha-4 , 11-diene 
to artemisinin, which host cells, tissues or organisms 
have expressed the said DNA sequence. 
10 27. Source as claimed in claim 26, wherein the 

cells are bacterial cells or plant cells. 

28. Source as claimed in claim 26, wherein the 
cells are disrupted . 

29. Transgenic cell, tissue or organism 

15 harboring in its genome more copies of a DNA sequence as 
claimed in claims 1-5 than are present in a corresponding 
non- transgenic cell, tissue or organism. 

30. Transgenic cell as claimed in claim 29, 
which cell is an E . coli cell. 

20 31. Transgenic organism as claimed in claim 29, 

which organism is a plant itself producing 
sesquiterpenes . 

32. Transgenic organism as claimed in claim 31, 
which organism is A . annua or V. oblonaif olia . 

25 33. Transgenic organism as claimed in claim 31, 

which organism is a plant selected from the group 
consisting of the genera Carum, Cichorium , Daucus, 
Juniperus , Chamomilla , Lactuca , PoQostemon and Vetiveria . 

34. Transgenic organism as claimed in claim 29, 
3 0 which organism is a plant in which the biosynthesis of 

sesquiterpenoids can be induced by elicitation. 

35. Transgenic organism as claimed in claim 34, 
which organism is a plant selected from the group 
consisting of the genera Capsicum , Gossypium , 

35 Lvcopersicon , Nicotiana , Phleum , Solanum and Ulmus . 

36. Transgenic organism as claimed in claim 29, 
which organism is a plant selected from the group 
consisting of soybean, sunflower and rapeseed. 
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Fig. 1 . Artemisinin 
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Figure 2. Postulated biosynthetic pathway of artemisinin in A. annua 
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Fig. 3 Amorpha-4,1 1-diene 
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Fig. 4. Radio-GC chromatograms of the [^H]-FPP-assays. A: Flame Ionization Detector 
(FID) signal of amorphadiene (reference). B: Radio signals of the labeled assay 
products amorphadiene (ret. time 14 min.) and famesol (as a product of aspecific 
phosphohydrolase activity, ret. time 28 min.) obtained with crude enzyme extract. C: 
Radio signal of the "^H labeled assay product amorphadiene obtained with Mono-Q 
purified enzyme extract. 
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Fig. 5. Mass spectnim of reference amorphadiene compared wth the mass spectrum of 
the FPP assay with terpene cyclases (synthases) purified from A. annua. This comparison 
yielded a qualit>' score of 99 %, corresponding with a maximum score of identicalness. 
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Fig. 6. Probe generated by PCR and cloned into pGEM IZt, 



p Primer A "[ 

27 gat gag aat ggg aaa ttt aag gaa 
DENGKFKE 

78 ctt gag ttg tac gaa gca act tct 
LELYEATS 

129 gaa gat get ctt ggt ttt aca cga 
EDALGFTR 

180 get ttt tct aca aac ccc get ctt 
AFSTNPAL 

231 caa ccc ctt tgg aaa agg ttg cca 
OPLWKRLP 

282 ttc tat caa caa caa gat tct cat 

333 tta gag ttc aat ttg ctt cag tea 
LEFNLLQS 

384 tgc aaa tgg tgg aaa get ttc gat 
CKWWKAFD 

435 gat aga att gtt gaa tgc tac ttt 
DRIVECYF 

486 cag tat tec egg get aga gtt ttc 
OVSRAR.VF 
p Primer B 

537 ctt ata gac gac acc ttc gac get 
LIDDTFDA 



teg tta get aat gat gtt gaa ggt ttg 
SLANDVEGL 

atg agg gta cct ggg gag att ata tta 
MRVPGEIIL 

tct cgt ctt age att atg aca aaa gat 
SRLSIMTKD 

ttt acc gaa ata caa egg gca eta aag 
FTEIQRALK 

aga ata gag gcg gcg cag tac att cct 
RIEAAQYIP 

aac aag act tta ctt aaa ctt get aag 
NKTLLKLAK 

ttg eac aag gaa gag etc age eat gtg 
LHKEELSHV 

ate aag aag aac gca cct tgt tta aga 
IKKNAPCLR 

^99 99^ eta ggt tea ggc tat gag cca 
WGLGSGYEP 



ttc aca aaa get gtt get gtt ata act 
FTKAVAVIT 

1 

acg g 
T 



Fig. 7, Nucleotide acid sequence and deduced amino acid sequence of the probe (538 bp) 
generated by PGR with the primers A and B. 
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Fig. 8, Released plasmid of a positive clone isolated from the cDNA library of induced .4. 
annua. 



|- £coR I {Not I) Adapter -j 

1 aattcgcggc cgcgtcgaca aatcatgtca cttacagaag aaaaacctat 

NltSR PIRR QIMS LTE EKP 
M EcoKX-^^ Not\ 

51 tcgccccatt gccaactttc ctccaagcat ttggggagat cagtttctca 
IRPI ANF PPS IWGD QFL 

101 tctatcaaaa gcaagtagag caaggggtgg aacagatagt gaatgattta 
lYQ KQVE OGV E Q I VNDL 

151 aaaaaagaag tgcggcaact actaaaagaa gctttggata ttcctatgaa 
KKE VRO LLKE AL 'D IPM 

201 acatgccaat ttgttgaagc tgattgatga aattcaacgc cttggaatac 
KHAN LLK LID EIQR LGI 

251 cgtatcactt tgaacgggag attgatcatg cattgcaatg tatttatgaa 
PYH FERE IDH ALQ CIYE 

301 acatatggtg ataactggaa tggtgaccgc tcttccttat ggttccgtct 
TYG DNW NGDR SSL WFR 

351 tatgcgaaag caaggatatt atgttacatg tgatgttttc aataactata 
LMRK QGY YVT CDVF NNY 

401 aagacaaaaa tggagcgttc aagcaatcgt tagctaatga tgttgaaggt 
KDK NGAF KQS LAN DVEG 

451 ttgcttgagt tgtacgaagc aacttctatg agggtacctg gggagattat 
LLE LYE ATSM RVP GEI 

501 attagaagat gctcttggtt ttacacgatc tcgtcttagc attatgacaa 
ILED ALG FTR SRLS IMT 

551 aagatgcttt ttctacaaac cccgctcttt ttaccgaaat acaacgggca 
KDA FSTN PAL FTE IQRA 

601 ctaaagcaac ccctttggaa aaggttgcca agaatagagg cggcgcagta 
LKO PLW KRLP RIE AAQ 

651 cattcctttc tatcaacaac aagattctca taacaagact ttacttaaac 
YIPF YOO QDS HNKT LLK 

701 ttactaaatt agagttcaat ttgcttcagt cattgcacaa qaaagagctc 
l' A K L E F N L L 0 S L H ff E E L 

751 acTccatarqt qcaaatggtg gaaagctttc gatatcaaga araacgcacc 
'S h' 7 'C K W W K A F D I K N A 

801 CLciztaaqa gatagaattq ttqaatacta cutttaggaa czaaqttcag 
F 'C L 'r D R I' v" E "c V F 'W "c 1"g S 

851 gciatgagcc acaqtattcc cqggctagaa rttrcrtcac aaaagctqtt 
G V E P Q Y S 'R A V F F T !<" a" V 



901 gctgttataa ctcttataga tgacacttat gatgcgtatg gtacttatga 
AVI TLI DDTY DAY GTY 

951 agaacttaag atctttactg aagctgttga aaggtggtca attacatgct 
EELK IFT EAV ERWS ITC 

1001 tagacacact tccagaatac atgaaaccga tatacaaatt attcatggat 
LDT LPEY MKP lYK LFMD 

1051 acatacacag aaatggaaga atttcttgca aaggagggaa gaacagatct 
TYT EME EFLA KEG RTD 

1101 atttaactgc ggcaaagaat ttgtgaaaga gtttgttaga aacctgatgg 
LFNC GKE FVK EFVR NLM 

1151 ttgaagcaaa atgggcaaat gagggacaca taccaaccac tgaagagcat 
VEA KWAN EGH IPT TEEH 

1201 gatccagttg taatcattac tggcggtgct aacctgctta caacaacttg 
DPV VII TGGA NLL TTT 

1251 ttatcttggc atgagtgata tattcacaaa agagtctgtc gaatgggctg 
CYLG MSD IFT KESV EWA 

1301 tctctgcacc tcctcttttt agatactcag gtatacttgg tcgacgccta 
VSA PPLF RYS GIL GRRL 

1351 aatgatctca tgacccacaa ggccgagcaa gaaagaaaac atagttcatc 
NDL MTH KAEQ ERK HSS 

1401 gagccttgaa agttatatga aggaatataa tgtcaatgag gagtatgccc 
SSLE SYM KEY NVNE EYA 

1451 aaaccttgat ttacaaggaa gtagaagatg tgtggaaaga tataaaccga 
OTL lYKE VED VWK DINR 

1501 gagtacctca caactaaaaa cattccaagg ccgttattga tggctgtgat 
EYL TTK NIPR PLL MAV 

1551 ctatttgtgc cagtttcttg aagttcaata tgcaggaaag gataacttca 
lYLC QFL EVQ YAGK DNF 

1601 cacgtatggg agacgaatac aaacatctca taaagtctct actcgtttat 
TRM GDEY KHL IKS LLVY 

1651 cctatgagta tatgactacc aatccttcgt gcatagccta tcaattatat 
PMS I-L PILR A-? INY 

1701 tgaaagggtt aactatgcac gtctctatgg agagaatttc tcaagctatt 
lERV NYA RLY GENF SSY 

1751 tqacatutct tactgqcaat aataaatcag acgcataaaa :i:atattgaa 
L' V F l' A 'g N ri K S DA- N C I E 

1801 ctatatccca ataactattt aaaattatta tacascta^s a'attcaaca 
L Y' a' D 'S Y L K L L Y tl - N I Q 

1851 atggtaitat acttttactt tgtacaaaaa caaaagtaca ciactattat 
0 W Y Y TFT L V K ' 5 K S T ILL 



1901 gtaacatttt agttctatga tactttagtt acgaatcggc ttatatacat 

CNI LVL - YFS YES AYIH 

1951 tgatacactt ttatgcagaa aaccctagta aataaaaagt cgatatcttg 

-YT FMQ KTLV NKK SIS 

2001 tactacacat atcgcacgaa tttccgtttg ccgtttgtat tttacgatat 

CTTH lAR ISV CRLY FTI 

2051 gttatttaat gaatatgttt catgtggttg ttgcttaaaa aaaaagtcga 

CYL MNMF HVV VA - KKISR 

p f/oti y £coRI ^ 

2101 cgcggccgcg aa 
R G R I E 
£coR I (yv<?/ I) Adapter 



Fig. 8"^. Nucleotide acid sequence and deduced amino acid sequence of a positive clone 
(amorphadiene synthase encoding gene) isolated from the cDNA library of induced^. 
annua. The sequence is flanked with EcoR I {No( I) adapters (Gibco BRL). 
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Fig. 9. Part, between start and stop codon(flanked resp. with a Nco I and BamH I site), of 
the amorphadiene synthase encoding gene cloned i^Nco I, BamW I) in the expression 
vector pET I Id. 



|- Sco I 1 

7281 ccatggcact tacagaagaa aaacctattc gccccattgc caactttcct 

TMA LTEE KPI RPI ANFP 
Start codon 

50 ccaagcattt ggggagatca gtttctcatc tatcaaaagc aagtagagca 

PSI WGD QFLI YQK QVE 

100 aggggtggaa cagatagtga atgatttaaa aaaagaagtg cggcaactac 
QGVE OIV NDL KKEV RQL 

150 taaaagaagc tttggatatt cctatgaaac atgccaattt gttgaagctg 
LKE ALDI PMK HAN LLKL 

200 attgatgaaa ttcaacgcct tggaataccg tatcactttg aacgggagat 
IDE IQR LGIP YHF ERE 

250 tgatcatgca ttgcaatgta tttatgaaac atatggtgat aactggaatg 
IDHA LQC lYE TVGD NWN 

300 gtgaccgctc ttccttatgg ttccgtctta tgcgaaagca aggatattat 
GDR SSLW FRL MRK QGYY 

350 gttacatgtg atgttttcaa taactataaa gacaaaaatg gagcgttcaa 
VTC DVF NNYK DKN GAF 

400 gcaatcgtta gctaatgatg ttgaaggttt gcttgagttg tacgaagcaa 
KQSL AND VEG LLEL YEA 

450 cttctatgag ggtacctggg gagattatat tagaagatgc tcttggtttt 
TSM RVPG EII LED ALGF 

5 00 acacgatctc gtcttagcat tatgacaaaa gatgcttttt ctacaaaccc 
TRS RLS IMTK DAF STN 

550 cgctcttttt accgaaatac aacgggcact aaagcaaccc ctttggaaaa 
PALF TEI ORA LKQP LWfC 

600 ggttgccaag aatagaggcg gcgcagtaca ttcctttcta tcaacaacaa 
RLP RIEA AQY IPF YQOO 

650 gattctcata acaagacttt acttaaactt gctaagttag agttcaattt 
DSH NKT LLKL AKL EFN 

700 gcttcagtca ttgcacaagg aagagctcag ccatgtgtgc aaatggtgga 
LLQS LHK EEL SHVC KWW 

750 aagctttcga tatcaagaag aacgcacctt gtttaagaga ragaattgtt 
KAF DIKK NAP CLR DRIV 

800 gaatgctact tttggggact aggttcaggc tatgagccac agtattcccg 
ECY FWG LGSG YEP QYS 

850 ggctagagtt ttcttcacaa aagctgttgc tgttataact cttatagatg 
RARV FFT KAV AVIT LID 



900 acacttatga tgcgtatggt acttatgaag aacttaagat ctttactgaa 
DTY DAYG TYE ELK IFTE 

950 gctgttgaaa ggtggtcaat tacatgctta gacacacttc cagaatacat 
AVE RWS ITCL DTL PEY 

1000 gaaaccgata tacaaattat tcatggatac atacacagaa atggaagaat 
MKPI YKL FMD TYTE MEE 

1050 ttcttgcaaa ggagggaaga acagatctat ttaactgcgg caaagaattt 
FLA KEGR TDL FNC GKEF 

1100 gtgaaagagt ttgttagaaa cctgatggtt gaagcaaaat gggcaaatga 
VKE FVR NLMV EAK WAN 

1150 gggacacata ccaaccactg aagagcatga tccagttgta atcattactg 
EGHI PTT EEH DPVV IIT 

1200 gcggtgctaa cctgcttaca acaacttgtt atcttggcat gagtgatata 
GGA NLLT TTC YLG MSDI 

1250 ttcacaaaag agtctgtcga atgggctgtc tctgcacctc ctctttttag 
FTK ESV EWAV SAP PLF 

1300 atactcaggt atacttggtc gacgcctaaa tgatctcatg acccacaagg 
RYSG ILG RRL NDLM THK 

1350 ccgagcaaga aagaaaacat agttcatcga gccttgaaag ttatatgaag 
AEO ERKH SSS SLE SYMK 

1400 gaatataatg tcaatgagga gtatgcccaa accttgattt acaaggaagt 
EYN VNE EYAQ TLI YKE 

1450 agaagatgtg tggaaagata taaaccgaga gtacctcaca actaaaaaca 
VEDV WKD INR EYLT TKN 

1500 ttccaaggcc gttattgatg gctgtgatct atttgtgcca gtttcttgaa 
IPR PLLM AVI YLC QFLE 

1550 gttcaatatg caggaaagga taacttcaca cgtatgggag acgaatacaa 
VQY AGK DNFT RMG DEY 

j- BamH I -j 

1600 acatctcata aagtctctac tcgtttatcc tatgagtata tgaggatcc 
KHLI KSL LVY PMSI - GS 

Stop codon 



Fig. 9"^. Nucleotide acid sequence and deduced amino acid sequence, of the amorphadiene 
synthase encoding gene, between stan and stop codon, (flanked with a ,\'co I and BamH I 
sire) obtained by PGR with the primers C and D. 
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Fig. 10, SDS-PAGE gel: Mw = low Molecular Weight marker (Pharmacia Biotech), lane 1 and 2 
rcsp. pellet and supernatant pET lid (negative controlc), lane 3 and 4 resp. pellet and supernatant 'I'HAS 
in pET ] Id (positive controlc). lane 5.7,9 and 6,8.10 rcsp. pellet and supernatant amorphadienc synthase 
in pET 1 Id. All constructs were expressed in E. coli BL21 (D£3).The lanes with the pellet fractions of 
TEAS in pET 1 Id (positive controie) and amorphadiene synthase in pET 1 Id showed a clear spot which was 
not present in the negative controlc pET lid. 
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Fig. 1 1. A: Flame ionization detector (FID) signals of amorpha-4,1 l-diene and famesol 
(references). Radio-GC chromatograms of the ['H]-FPP-assays with, B: intact 
BL21(DE3) cells and C: the supernatant of sonicated BL2l(DE3) cells, both transformed 
with the amorphadiene synthase encoding gene in the expression vector pETl Id. 
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Fig. 12 Hypothetical chemical synthesis of dihydroaeteannuic acid using amoqpha-4,1 1-diene as a precursor. 
The reaction consist of an enantio-. stereo- and regio selective (anti-markownikoff) hydroboration of 
amorphadicne with BH3 followed by an oxydation of the formed trialkylboranes with NaOH/H202 yielding 
the alcohol. A mild oxidation of the alcohol to the acid can be obtained with PCD (pyridinium dichromate) 
without attacking the second bouble bond. 
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Fig. 13 Determination of the molecular weight of amorpha-4,1 1-diene synthase by size- 
exclusion chromatography (gel filtration). (-*-): activity curve; (-^ -)- molecule weight 
markers: ( — ): molecular weight calibration line. 



